A Comparative Study on Linguistic Feature Selection in Sentiment Polarity Classification

نویسنده

  • Zitao Liu
چکیده

Sentiment polarity classification is perhaps the most widely studied topic. It classifies an opinionated document as expressing a positive or negative opinion. In this paper, using movie review dataset, we perform a comparative study with different single kind linguistic features and the combinations of these features. We find that the classic topicbased classifier(Naive Bayes and Support Vector Machine) do not perform as well on sentiment polarity classification. And we find that with some combination of different linguistic features, the classification accuracy can be boosted a lot. We give some reasonable explanations about these boosting outcomes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sentiment Classification and Polarity Shifting

Polarity shifting marked by various linguistic structures has been a challenge to automatic sentiment classification. In this paper, we propose a machine learning approach to incorporate polarity shifting information into a document-level sentiment classification system. First, a feature selection method is adopted to automatically generate the training data for a binary classifier on polarity ...

متن کامل

GermanPolarityClues: A Lexical Resource for German Sentiment Analysis

In this paper, we propose GermanPolarityClues, a new publicly available lexical resource for sentiment analysis for the German language. While sentiment analysis and polarity classification has been extensively studied at different document levels (e.g. sentences and phrases), only a few approaches explored the effect of a polarity-based feature selection and subjectivity resources for the Germ...

متن کامل

An Evaluation of Sentiment Analysis and Classification Algorithms for Arabic Textual Data

Sentiment analysis is a recent advance in text mining applications for analyzing textual data according to orientation of human comments to determine whether they are positive, negative, or neutral. Different data mining techniques and algorithms such as support vector machine, naïve Bayes, decision tree, k-nearest neighbor and other techniques are used for analyzing textual data. These techniq...

متن کامل

Improving Document-Level Sentiment Classification Using Contextual Valence Shifters

Traditional sentiment feature extraction methods in documentlevel sentiment classification either count the frequencies of sentiment words as features, or the frequencies of modified and unmodified instances of each of these words. However, these methods do not represent the sentiment words’ linguistic context efficiently. We propose a novel method and feature set to handle the contextual polar...

متن کامل

Finding Domain Specific Polar Words for Sentiment Classification

This paper presents a method of using conditional random fields (CRF) for extracting polar words and determining the overall sentiment of text. We frame sentiment classification as a feature selection problem and conduct three sets of experiments by using: prior polarity lexicons, bag-of-words classifiers and CRF sequence models. The results show the potential of utilizing CRFs in discovering h...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1311.0833  شماره 

صفحات  -

تاریخ انتشار 2013